Learning Comprehensible Theories from Structured Data

نویسنده

  • John W. Lloyd
چکیده

This thesis is concerned with the problem of learning comprehensible theories from structured data and covers primarily classification and regression learning. The basic knowledge representation language is set around a polymorphically-typed, higherorder logic. The general setup is closely related to the learning from propositionalized knowledge and learning from interpretations settings in Inductive Logic Programming. Individuals (also called instances) are represented as terms in the logic. A grammar-like construct called a predicate rewrite system is used to define features in the form of predicates that individuals may or may not satisfy. For learning, decisiontree algorithms of various kinds are adopted. The scope of the thesis spans both theory and practice. On the theoretical side, I study in this thesis 1. the representational power of different function classes and relationships between them; 2. the sample complexity of some commonly-used predicate classes, particularly those involving sets and multisets; 3. the computational complexity of various optimization problems associated with learning and algorithms for solving them; and 4. the (efficient) learnability of different function classes in the PAC and agnostic PAC models. On the practical side, the usefulness of the learning system developed is demontrated with applications in two important domains: bioinformatics and intelligent agents. Specifically, the following are covered in this thesis: 1. a solution to a benchmark multiple-instance learning problem and some useful lessons that can be drawn from it; 2. a successful attempt on a knowledge discovery problem in predictive toxicology, one that can serve as another proof-of-concept that real chemical knowledge can be obtained using symbolic learning; 3. a reworking of an exercise in relational reinforcement learning and some new insights and techniques we learned for this interesting problem; and 4. a general approach for personalizing user agents that takes full advantage of symbolic learning.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Logic for learning - learning comprehensible theories from structured data

Give us 5 minutes and we will show you the best book to read today. This is it, the logic for learning learning comprehensible theories from structured data 1st edition that will be your best choice for better reading book. Your five times will not spend wasted by reading this website. You can take the book as a source to make better concept. Referring the books that can be situated with your n...

متن کامل

Alkemy: A Learning System based on an Expressive Knowledge Representation Formalism

This paper describes the design and analysis of a system developed to learn comprehensible theories from structured data. The underlying knowledge representation formalism is a polymorphically-typed higher-order logic. To model structured data, a class of terms suitable for representing a wide range of data is identified. To encode structural boolean features, a class of predicates that can be ...

متن کامل

An Inductive Learning System for XML Documents

This paper presents a complete inductive learning system that aims to produce comprehensible theories for XML document classifications. The knowledge representation method is based on a higherorder logic formalism which is particularly suitable for structured-data learning systems. A systematic way of generating predicates is also given. The learning algorithm of the system is a modified standa...

متن کامل

A Theoretical Analysis on Structured Learning with Noisy Data and its Applications

Abstract: Performances of supervised machine learning will be affected by noises of labeled data badly, which has been well studied by existing theories about learning with noisy data. However these theories only focus on two-class classification problems. In this paper we studied the relation between noise examples and their effects on structured learning. Firstly, we found that noises of labe...

متن کامل

A Tutorial Introduction to Alkemy

This paper serves as a tutorial introduction to Alkemy, a decision-tree learner designed to induce comprehensible theories from structured data. A short and clear account of the motivations behind the development of the system is in [FGCL98]. [BGCL01] gives a more detailed discussion of the relevant knowledge representation issues and presents the main algorithms. A rigorous treatment of the lo...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2002